Picture for Ivan Vulić

Ivan Vulić

Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning

Add code
Jan 28, 2026
Viaarxiv icon

Value of Information: A Framework for Human-Agent Communication

Add code
Jan 10, 2026
Viaarxiv icon

ReCoVeR the Target Language: Language Steering without Sacrificing Task Performance

Add code
Sep 18, 2025
Viaarxiv icon

11Plus-Bench: Demystifying Multimodal LLM Spatial Reasoning with Cognitive-Inspired Analysis

Add code
Aug 27, 2025
Viaarxiv icon

Quantifying Language Disparities in Multilingual Large Language Models

Add code
Aug 23, 2025
Viaarxiv icon

RAVENEA: A Benchmark for Multimodal Retrieval-Augmented Visual Culture Understanding

Add code
May 20, 2025
Viaarxiv icon

Visual Planning: Let's Think Only with Images

Add code
May 16, 2025
Viaarxiv icon

A Call for New Recipes to Enhance Spatial Reasoning in MLLMs

Add code
Apr 21, 2025
Viaarxiv icon

Cross-Tokenizer Distillation via Approximate Likelihood Matching

Add code
Mar 27, 2025
Viaarxiv icon

Training Plug-n-Play Knowledge Modules with Deep Context Distillation

Add code
Mar 11, 2025
Viaarxiv icon